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■ Abstract: The minimum entropy production principle provides an approxima- 
, tive variational characterization of close-to-equilibrium stationary states, both for 

macroscopic systems and for stochastic models. Analyzing the fluctuations of the 
empirical distribution of occupation times for a class of Markov processes, we iden- 
tify the entropy production as the large deviation rate function, up to leading order 
when expanding around a detailed balance dynamics. In that way, the minimum 
entropy production principle is recognized as a consequence of the structure of dy- 
namical fluctuations, and its approximate character gets an explanation. We also 

■ discuss the subtlety emerging when applying the principle to systems whose degrees 
. of freedom change sign under kinematical time-reversal. 

o 

1. Introduction 

*no : 

O ■ Over the last century many attempts have been made to give a vari- 

"c^ ational characterization of nonequilibrium conditions. The motivation 

was often found in the successes of variational methods in mechan- 
ics and in equilibrium statistical thermodynamics. Many so called ab 
c$ , initio methods in solid state physics have a variational character. For 

nonequilibrium purposes the best known but also widely criticized vari- 
ational principle, that of the minimum entropy production principle 
(MinEP), goes back to the work of Ilya Prigogine, [18]. In the present 
paper we will restrict us to the version of the MinEP for Markov pro- 
cesses as was first described and proven by Klein and Meijer for some 
specific Markov models, see also [HH El [15] . 

As has been clear since a long time, the MinEP is only valid in 
some approximation. Without doubt, one restriction is that the sys- 
tem must be close to equilibrium, allowing only for a small break- 
ing of the detailed balance condition; that is often referred to as the 
regime of irreversible thermodynamics. Yet, the situation is more sub- 
tle and there have appeared examples in the literature violating the 
MinEP even close to equilibrium, [HI E]. The situation is even more 
complicated and downright controversial when dealing with examples 
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of macroscopic physics, where both positions and velocities mix and 
things appear to depend on the level of coarse graining. 
At any rate and because of the enormous advantages of variational 
characterizations, there has been a continued interest in the nature of 
Prigogine's MinEP. It remains therefore very interesting to see if the 
principle can be understood, not only by direct verification as was done 
in pm [15], but also from the context of fluctuation theory. After 
all, also in equilibrium statistical physics there is an intimate relation 
between the variational principle characterizing equilibrium and the 
structure of equilibrium fluctuations. The very reason why thermody- 
namic equilibrium is characterized by maximum entropy or, depending 
on the context, by minimum Helmholtz or Gibbs free energy, is exactly 
because these thermodynamic potentials also appear as rate functions 
in the exponents governing equilibrium probabilities. 

We show in this paper that a relation exists between the MinEP and 
the structure of steady state fluctuations for Markov processes. Our 
main finding is that the entropy production naturally emerges when 
analyzing the fluctuations of occupation times, first studied in the gen- 
eral context of the theory of large deviations by Donsker and Varad- 
han [1] . We show that in the close-to-equilibrium regime and when the 
state variable is even under time- reversal, the Donsker- Varadhan (DV-) 
functional coincides to leading order with the entropy production rate. 
When the state variables are odd under time-reversal, such as for the 
electric current in the famous counter example of [HI E] , that affine re- 
lation between entropy production and the DV-functional is no longer 
valid. It remains of course generally true that the variational princi- 
ple associated with the DV-functional is a valid generalization of the 
MinEP. Yet, a useful scheme for the computation of the DV-functional 
for processes far from equilibrium remains an open problem. 

The structure of the paper is as follows. In Section [2] we present a 
brief introduction to the large deviation theory of occupation times. In 
the mathematical details we often restrict ourselves to the case of con- 
tinuous time and irreducible Markov processes on a finite state space. 
Many of the arguments have however a larger validity. For example, 
for a detailed balanced dynamics the DV-functional can be computed 
explicitly; we review that result in Section [3] with a new proof that is 
not restricted to finite state spaces. 

Our main result follows from a perturbative evaluation of the DV- 
functional close to equilibrium and is contained in Section HI first on 
a formal and general level and then rigorously for finite state space. 
In Section [5] we explain how and when the leading order of the DV- 
functional is related to the physical entropy production. That relation 
is formulated in our main Theorem 15.11 
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We end with a variety of remarks and conclusions in Section [7J In 
particular, we briefly explain there the situation for Landauer's coun- 
terexample, [HI E] • 



2. Large deviations for the occupation times 

Suppose that (X t )t>o is a stationary ergodic Markov process. For 
most of what follows, we do not need to specify whether it is a jump 
process or a diffusion process, and on what space. Yet, it is sufficiently 
instructive and mathematically non trivial to keep in mind a Markov 
process on a finite space which is irreducible. We are interested in the 
fraction of time that X t spends in some set A of states. Formally, we 
define the empirical distribution pt as 

PT (A) = ^J^ dtS Xt£A (2.1) 

($x t <=A — 1 if X t 6 A and zero otherwise.) As we assume a unique 
stationary measure p, we have that almost surely pr(A) — > p(A) as 
T | +oo, by ergodicity. Yet there are fluctuations around that average 
and we can ask how big they are. That is a subject in the theory of 
large deviations and the answer is given by the asymptotic formula 

P T [p T ~ p] ~ exp[-T/(p)] 

that has to be understood in a logarithmic sense after taking the limit 
T f oo. The rate function I has been found by Donsker and Varadhan 
H El E] in the form 

I(ja)= sup -l^-) (2.2) 

where L is the generator of the Markov process and denotes the 
expectation under the measure p. For a finite state space fl, 

Lg(x) = ^2k(x,y)\g(y)- g(x)], xGfi 
yen 

where k(x, y) > is the rate for the transition x — >• y. 
The Donsker- Varadhan (DV-)functional is always nonnegative, I(p) > 
0, and the equality takes place if and only if p = p is the invariant 
measure. For the precise mathematical formulation we refer to [21 El H]. 

In general, the DV-functional ( 12.21) is not so simple to compute ex- 
plicitly, the main problem of course being to find the maximizer g. An 
important case where the solution has been known and is explicit is 
a reversible (or detailed balance) dynamics. These basic facts are re- 
viewed in the next section; our formulation is slightly more general than 
those provided by the standard references [H [3] . The rest of the paper 
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is then devoted to identifying the leading term in the DV-functional 
for a dynamics breaking the detailed balance. 



3. Detailed balance dynamics 
Suppose that for any pair of real- valued functions <j) and ijj, 



(3.1) 



where (-) p is the expectation under the stationary Markov process. The 
corresponding symmetry of the generator can be obtained under 



lim-(0(x o ) [ip(x T ) -ifj(x )}) p = (<f)Lijj) p 

tJ.0 t 

Theorem 3.1. Under condition (13. ip . the DV-functional is 



fLVf) 



(3.2) 



where f = ^ is the density of jj with respect to the reversible measure 



Remark 3.2. One recognizes the Dirichlet form £}(g,g) = —(gLg) p 
which is related to the spectral gap by 



A= inf 

g:(g) p =0 (g 2 ) p 



(3.3) 



As a consequence, one has the bound 



> A[{f) p - (y/m= A[l - ( Vf) P ] 



(3.4) 



Proof. A standard proof for finite state space can be found e.g. in [3]. 
Here we present a new variant of that argument that works for a general 
(detailed balanced) Markov process. 
From flZ2D , 



1(a) = suplim — 1 
9 >o HO r L 

= sup lim — 1 

9>0 r|o r L 



e rL g 



g l M J 
f(x )g(x T ) 
g(xo) 



(3.5) 



Using reversibility (13.1 1) we subsequently get 

/ f(x )g(x T ) \ = 1 / f(x )g(x T ) f(x T )g(x ) \ 
\ g(x ) lp 2\ g(x ) g(x T ) // 




f(x )g(x T ) _ f(x T )g(x ) 
g(xo) V ^(ar T ) 

+ <V / /(^o)/(^)) p 

> Wf{x Q )f{x T )) p 




(3.6) 



which is an optimal lower bound since the equality is attained if (and 
only if for an irreducible dynamics) g oc y/J. Hence, 

/(//) = lim-[l - (Vfe rL yfi) p ] = -<V?W7>p (3.7) 



rj.0 r 

as claimed. □ 

4. Perturbative evaluation of the DV-functional 

4.1. Formal derivation. Fix a reference detailed balance dynamics 
with generator Lq and with reference measure p°, as in Section [31 For 
a measure /i we write / = ^ for its density with respect to p°. A 
simple computation gives 

*<R.=(-J^ + ^ (41) 

where the adjoint L + is defined by 

(0L^) p o = (^L + 0) p o (4.2) 

on real functions. Hence, searching for the maximizer g* of ( 12.2ft nor- 
malized to (<7*)p0 = 1, we need to solve the equation 

-LLg* = L + 4 (4.3) 
9 9 

Note that for L = L = Lq that equation has a solution g* = v7/ (\ff) P °, 
in agreement with the conclusions of Section [31 

Next, for a close to equilibrium dynamics and for small fluctuations we 
expand L, /, and g in power series, 

L e = L + eL x + e 2 L 2 + ... (4.4) 

r = l + ef 1 + e 2 f 2 + ... (4.5) 

3 £ = 1 + egi + e 2 # 2 + . . . (4.6) 
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and solve (14. 3 p perturbatively. Up to order e it yields 

2L ^ = LoA + L+l 



(4.7) 



which is to be solved under the normalization constraint (gl) p ° — 0. 
That can be achieved as follows. Writing = h e for the density of 
the (presumably unique for small e) stationary measure under L e with 
respect to the reference reversible measure, the stationary equation 
p € L e = can be equivalently written as (L € ) + h e = 0. Expanding again 
h e = 1 + eh\ + . . ., we find that h\ verifies 



and, by definition, (/ii) p o = 0. As a consequence, g\ = (f\ — hi)/2 is 
a solution of (14. 7p . Provided that g* is in fact a global maximum, the 
DV-functional (12.21) becomes, up to leading order, 



r(//) = -^-{/iLo/i - hiL hi + ZLih ~ + o(e 2 ) 

n: nr. (4-9) 



The functional I e itself obviously also depends on e as from (14. 4ft ; we are 
dealing with a dynamics close to a reference reversible dynamics. Ob- 
serve that, since f e dp 6 = h e dp 6 , the leading term in the DV-functional 
(14. 9 p (always for small deviations from equilibrium) resembles the DV- 
functional (13. 2p for the case of detailed balance. In (14.91) that leading 
term is now of order e 2 . 

4.2. Rigorous result. The above formal perturbative argument can 
be justified on a mathematically precise level. In the present section 
we refine the above reasoning by restricting ourselves to the frame- 
work of continuous time Markov dynamics with a finite state space. 
Note that many of the standard nonequilibrium examples of stochastic 
lattice gases or interacting particle systems on a finite graph are thus 
included [HI [12j [20]. Observe also that some precision or justification 
is indeed needed, as one can otherwise construct counter examples to 
the results that will follow. Other "infinite" or "continuous" models 
including diffusion processes, still require additional estimates for a 
proper mathematical treatment, that we are not giving here though; 
we will comment on one important example in Section [7J On the whole 
and perhaps surprisingly, even only to first order around equilibrium, a 
general and mathematically precise identification of the DV-functional 
does not appear easy. 

We fix a finite state space Q, which will serve as vertex set for irre- 
ducible directed graphs respectively with rates k°(x,y) > (reference 
detailed balance) and with rates k e (x,y) > (perturbation) between 
the states x — > y. We assume the reference rates k°(x,y) define an 



Loh, = -L+l 



(4.8) 
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ergodic Markov process with the stationary distribution p° > and 
such that p°(x)k°(x,y) = p°(y)k°(y,x), sufficient for the reversibility 
in (13. ip . The perturbed rates k e (x,y) defined for |e| < e with some 
eo > 0, are assumed to be a smooth modification of the k°(x,y). For 
small enough e the perturbed dynamics is hence ergodic too, with a 
unique invariant distribution p e > which is a smooth modification of 

The modified dynamics has the generator 

Ug(x) = k ^ V) 19(V) - 9(x)} (4.10) 

We further denote 

M +1 = {g > 0; (g) p o = 1} , M s +1 = {g e M +1 ; g{x) >5,xett} 



and we consider the functional 



(4.11) 



J }(9)= ^ p°(z)f(z)k'(z,v)\l-2¥\ 

L 9{x)i 

for / G M s +l on g E M +1 . 

Proposition 4.1. Suppose that f G Mfj for some 5 > 0. For all 
sufficiently small \e\, the functional has a unique maximizer g* e (f) 

in M + i, and g* € (f) \ff j (v7)p°; uniformly in M^. 

Theorem 4.2. If p e is a smooth deformation of p° = p° , then the 
DV-functional / e (/x e ) has a Taylor expansion in e around e = 0, with 
leading term 



The proofs are postponed to Section El 



y dp' y dp' /po 



(4.12) 



5. Relation with entropy production 

We proceed with the physical interpretation of the formula (I4.12p 
for the DV-functional. It will turn out that (14.121) equals the excess of 
entropy production with respect to the stationary entropy production. 
Clearly, to explain, we need some physical context for the dynamics 
itself. However in order to avoid relying solely on concrete examples, we 
can start from the quite general observation that the physical entropy 
production as a variable on path-space is measuring the breaking of 
time-reversal symmetry. That has been argued for at various places, 
see e.g. [T5l [13] and references therein. When the distribution at time 
zero is given by p, then the entropy production over the time interval 
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[0, r], is just the relative entropy of the path-space distribution with 
respect to its time-reversal: 

where (Qu)t = u T -t is the time reversal of the trajectory u and \i T 
is the evolved distribution at time r, i.e., the solution of the Master 
equation -j^ = p, t L, /x = H- Since the process is Markovian, the mean 
entropy production can be written as S T {p) = cr(jit) dt where 

a (u)=\ im tilM (5.2) 

r|0 T 

is the mean entropy production rate. Taken as a functional on distri- 
butions fi, (15.21) is the crucial quantity to be discussed in the present 
section. In particular we can evaluate it under the same conditions as 
for Theorem 14.21 It means that we evaluate the entropy production 
rate in // and that we have a dynamics that is close to equilibrium, in- 
dicated by changing the notation a to a e . The main result of the paper 
is then summarized in the following general and remarkable relation: 



Theorem 5.1. Under the conditions of Theorem 4-2, 

Before we give the proof of that Theorem, we briefly remind the 
reader of the physical context of entropy production, at least within the 
limited set-up of Markov jump processes. We refer to [19], [T71 El [El El] 
for additional material. 

5.1. Entropy production in Markov jump processes. For the 

Markov jump processes of Section l4~2l the entropy production rate ( 15. 2ft 
becomes 

In the case of detailed balance, p(x)k(x,y) = p(y)k(y,x), it is easily 
verified that cr(/x) is the time derivative of the relative entropy: 

= log ^f~\ Y [M^O, y) - v(v)p(y, x )] 

x "\ J y^ x 

~s>$^L ou) 

d Of.. I -M Of.. I ^ \^ ..f„\,„„P( X ) 



-—S(nt\p)\ t=0 , S(p\p) = ^2 fi(x) log ^ 
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When there is a driving away from equilibrium, there is some mean 
entropy production even in the stationary regime. To be specific, as- 
sume that each state x is given an energy E(x) and that the transition 
x <-> y is possible thanks to the interaction with a heat reservoir at 
inverse temperature (3(x,y) = j3(y,x). The rates are taken to satisfy 
the local detailed balance condition 

k ( X ^) = e P(x,y)[E(x)-E(y)) / g g N 

k(y,x) 

For a motivation, see [5j [15]. As a reference we have the Boltzmann- 
Gibbs distribution p(x) oc e~ l3E ^ with j3 some reference inverse tem- 
perature. 

Entropy production rate (15.31) can be split into a contribution which 
is associated to the system and can be written as the time derivative 
of some entropy function, and a part measuring the change of entropy 
in the environment, i.e., 

a(fj) = a s (fj) + <tr(h) 
For the system part we take, compare with f)5.4p . 

H{x)p{y) 



= Y] v(x)k(x,y)\og' 
= -^S(fi t \p)\ t=0 

d {s(vt)-m» t ]\ t=0 



(5.6) 



dt 

with S(p) = — J2 X M x ) 1°S M x ) the Shannon entropy and (E)^ = 
fj l (x)E(x) the mean energy. Hence, as{p) is recognized as {—(3 
times) the rate of change in the free energy. 
The environment part is then 

p(x)k(x,y) 



x ^ x P{y)k{y,x) 



= \J2 [Pfay)-0\ [E(x)-E(y)] [p(x)k(x,y) - p(y)k(y,x)] 

x,y^x 
x,yj^x 

(5.7) 

where {Je(x, y))^ = (Je(?/, x))^ is the mean energy transfer, or heat, 
to the reservoir associated with the transitions x <-» y. In other words, 
<Jr{p) is the change of entropy in the environment plus the term 

P E ( x ^ [v( x ) k ( x i y) - Ky) k (y, x )] = P^ (e)ih U=o 
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which is just the counter term we have subtracted from the system part 

5.2. Proof of Theorem 15.11 Following our general strategy, we com- 
pute (15. 2p by a perturbation expansion around a reference detailed bal- 
anced dynamics. Again, the expansion is mathematically fully justified 
for a finite state space, at least under the conditions of the theorem. 



We split the entropy production rate similarly as in the previous 
section, taking now the invariant distribution p° corresponding to e = 
as the reference: starting from ( 15. 2ft . 



a(p) 



lim -(log -^(^ ) 
t|o r \ dp u 



i dp T , 

log -r-^M + lof 



lim — 

rio r 



log 



dp \ 
dp% 



log 



dp 
dp T 
dp° / 



dp; () 0/ m 



r 1 A dP > 
+ lim — ( log : — ^ 



(5.1 



log d>), T 



du r . du T 
log 



dp dp 



The first term is the limit 

0s(p) = lim - [(log ^) 
rj.o r L \ dp u / n 

= -(Llog/) /1 

Expanding both p = p e and L = L e as in (I4.4l) - (l4.5p . we get 



/' J 



(5.9) 



■ </ £ L e log f)p° = -e 2 {hL Q h +L 1 f 1 ) p0 +o(e 2 ) (5.10) 



Similarly, for the second term in (I5.8p . now denoted by a € R , we have 



^7( l0g Zp4 H " l0g 



dP 



d P T ' e 

rio r\dp° l ° J dP T j? 



dP T '° 



nr(©^; 

dp;r ' 



o(e 2 ) 



lim — / 

rlO T \ dP 



>r,0 



rdp/ 
I dp 



dp e 

oy 



+ lim — 

rjo 2r 



dP T b e 
£!_ 

dP T b° 



dP r ' e 



dP^ 1 



2\ 



of 



-(L £ /%o + A t + o(e 2 ) 
-e 2 (L 1 / 1 ) p o + A e + (e 2 ) 



(5.11) 



where P^b and (-)° refer to the path-space distribution under the 
reference detailed balance dynamics (e = 0) started from p°. The term 
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A e is simply independent of f e . All in all we have found, up to leading 
order, 

= -e 2 {hL f\ + 2L 1 / 1 ),o + A e + o(e 2 ) (5.12) 
Comparing with the result (14.91) or (14. 12ft finishes the proof. 

6. Proof of Proposition 14.11 and of Theorem 14.21 

Let < 5 < 1 be given and fix p by giving / = £ ^I+i- in order 
to localize the maximizer g* € (f) of the functional Jj, we decompose the 
set M + i as follows. Given a, (3 > such that a + (3 < \f~8~ we introduce 



KM = {9£ M +1 ; \g(x) - | < a, x e fi} (6.1) 

Obviously, N^Qi) C Mf 1; and writing [M^] c = M +1 \ we have 
the disjoint decomposition 

M +1 = N*M U [M p +1 \ N^(pi)] U [M^f (6.2) 

In what follows we are going to prove that, choosing e,a,f3 small 
enough, the functional Jj takes its maximum inside N+^fi), and that 
is unique by a local convexity argument. 

We start with a lemma that follows immediately from the assump- 
tions. Recall that the state space is assumed finite; let \Q\ = N. 

Lemma 6.1. There is an irreducible graph G with vertex set Q and 
for which over all edges (x,y), p°(x)k°(x,y) > 7 for some 7 > 0. 
Moreover, k £ (x,y) > ^k°(x,y) for all sufficiently small |e| > 0. 

The next lemma states that when g is outside M+ 1; then Jj(g) can 
be made very negative ((3 I 0). 

Lemma 6.2. For all sufficiently small |e| and for all g G [M^-J , 

J}(g) < C - ~ 7 <5/T^ (6.3) 

with C a constant independent of f , g and e. 
Proof. From the previous lemma, 

g(y) 



J }(9)= ^P°(x)/(x)^(x ?l/ )[l- 

l gyx) 

x,y^x 



g(x) g(y) 



<max2_^k (x,y) - - ^ p u {x)k u (x,y) 

y+x (x,y)£G 

2 l5 \< b 



< c 

(x,y)GG 



(6.4) 
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for a suitable C and e small enough. Since g G [M^] c , there is x G f2 
such that g(x) < [3. Hence there exists a pair (x, y) G G such that 
either g(y) > P~^- 1 g(x) or ^(x) > P~ N - 1 g(y). To see that, assume 
this is not true and denote by l{x) the length of the shortest path in G 
connecting x and x. Then, using l{x) < N — 1, 

(g) p o < maxg(x) < g(x) max/3 v- 1 < (3~ g{x) < 1 (6.5) 

X X 

which is a contradiction. □ 
Now comes the statement that the maximum is also outside \ 

Use the shorthand go = \ff j (y/f) p o. 

Lemma 6.3. For all sufficiently small \e\ > we have Jj(g) < Jf{go) 
whenever g G \ A r ° 1 (/i). 

Proof. As clear from the proof of Theorem 13.14 go is the unique max- 
imizer of J° in M +1 due to the irreducibility assumption. Using that 

-W+i \ A" 1 (/i) is a compact set (in the Euclidean metric, say), 

sup J%g) < J° f (g ) (6.6) 

By the continuity of Jj(g) at e = which is uniform in g G M^, we 
can choose |e| > sufficiently small so that (16. 6p extends to 

sup J}(g) < Jj(g ) (6.7) 

Hence, the lemma follows. □ 

Lemma 6.4. There is a > such that for all sufficiently small \e\ > 0, 
Jj is a strictly concave function in iV" 1 (/^). 

Proof. For any ip : Q — > IR, a direct computation yields 



dt 2 



x,vtx (gg) 

2 



Since the term in the square bracket is strictly positive unless ijj(x)/ijj(y) 
V 70*077(y)j an< ^ usm g Lemma 16.11 the right-hand side in (16.81) is 
strictly negative unless tp oc s/f. In particular, it implies d 2 Jj(go + 
tijj)/dt 2 (t = 0) is a strictly negative quadratic form on the linear sub- 
space defined by (ip) p o = 0. 

By continuity, it first extends to the strict negativity of the quadratic 
form d 2 J Q f (g + t?/0/dt 2 (t = 0) on the same linear subspace and for all 
g G A r " 1 (/i) with some a > 0. Finally, it implies the strict negativity 
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of d 2 Jf(g + ty)/dt 2 (t = 0) on (ip) p0 = for all g E N^(fi) and for all 
sufficiently small |e| > 0. □ 

Proof of Theorem \4-S\ Pick some a > such that Lemma 16.41 holds, 
and fix (3 > to satisfy J^(go) > C — \^/5f3~ 1 ^ 1 for all |e| > small 
enough. Then by Lemmata 16.2116.31 the maximizer of exists and is 
localized in N°i, moreover it is unique by Lemma [6.41 As the a can 
be chosen arbitrarily small (and observe that a j, drives e | 0), one 

also has g* e (ff) ^ go- 

If k e (x,y), x 7^ y G f2, are all differentiable then the maximizer 
g* e coincides with a solution of (14.31) in the domain A^; recall the 
latter necessarily exists and is unique. The perturbative calculation 
in Section 14.11 then follows by an application of the inverse mapping 
theorem. □ 



7. Conclusions: minimum entropy production principle 

7.1. Summary. Our analysis goes beyond merely checking MinEP; 
rather, it enables to view it as a consequence of a dynamical variant 
of the Einstein's formula for equilibrium fluctuations. In simple terms, 
Theorem (15. ip reads that the probability for the empirical distribution 
Pt to coincide with some /x e = p e + 0(e), has the following generic 
structure: 

p T ^(p T ~ //) oc e -ii^n+o^)]+o(T) (71) 

By ergodicity, the maximal probability is obtained for = p e - Ac- 
cording to the above it is also obtained by minimizing the entropy pro- 
duction. Hence, the minimum entropy production principle emerges 
as an immediate consequence of the structure of dynamical fluctua- 
tions. Moreover, its approximate status is also understood since the 
relation between the entropy production and the true DV-functional 
is restricted to the leading order of expansion around equilibrium. A 
systematic perturbation expansion of I e (p e ) would provide corrections 
to that principle; we will not discuss that issue now. Some further 
remarks end the paper: 

7.2. Remarks. 

(1) What has been said so far about the entropy production is sub- 
ject to one further physical condition: that the Markov process 
describes the dynamics of time-reversal symmetric variables. 
Only then are (15.21) or (15.31) correct expressions for the entropy 
production rate. Yet, certain observables like e.g. momentum 
or magnetic field have the property that even a closed system 
dynamics cannot be expected detailed balanced in the sense 
of (13. ip . Instead, a symmetry under time- reversal can only be 
seen when also the sign of these, so called time-reversal odd 
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observables is changed. A deeper reason why such a generaliza- 
tion is needed is that the fundamental equations of motion are 
often second order in time. For processes on variables that are 
odd under time-reversal the above analysis needs a modification 
(see also the next remark). 
(2) We give an example of a Gaussian Markov diffusion process 
(X t ). Suppose a Langevin dynamics of the form 




dX t = (E--yXt)dt + J-£dW t (7.2) 



with standard Wiener process W t . The force 8 is constant and 
7 > is some friction coefficient. For scalar X t G R the process 
is detailed balanced in the sense of (13.11) with respect to p(dx) cx 
exp[— |(x— ^) 2 ] dx, a Gibbs distribution for inverse temperature 
(3. From Theorem 13.11 one easily computes the corresponding 
DV-functional to be 

Is that equal to the entropy production? It now depends on 
whether X t is even or odd under time- reversal. 

Assume first that X t models the position of an overdamped 
oscillator. That is an even variable and the detailed balance (13.11) 
is verified; the stationary process is in equilibrium. The entropy 
production is found most easily from (15.41) : 

^) = 4s ( ,<|p)U = l(<^ (7.4, 

Hence, we get a(fi) = J(/i)/4, consistent with our general re- 
sult. 

Alternatively, suppose now that X t = V t is instead the fluctu- 
ating velocity of a Langevin particle dragged by force S. Al- 
though ( 13. II) remains valid, it no longer expresses time-reversal 
invariance since the kinematical time-reversal (changing the 
sign of the velocity) is not applied. Furthermore, if £ ^ 0, 
then ((f)(vo) , i(j(v T )) p ^ (<j>{— v T ) ip(— vq)) p breaking even a (gen- 
eralized) reversibility. In particular, there is for ^^0a nonzero 
stationary entropy production. That mean entropy production 
can be obtained by the methods of Reference [TB] in the form 
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and is different from (17.41) . Using that (f') p = P(v — -) (U and 
cr(p) = PS 2 / 1 -/, we obtain the following modification of Theo- 
rem I5.lt 

lOM) = ±[(rQi) + <T(p)-2pe{v) lt ] (7.6) 

In particular, the stationary distribution is now found as a min- 
imizer of the functional er(/i) — 2/3£(v) IJ ,. Equivalently, since 
a(p) = (38(v) p , the stationary measure is now characterized by 
a (constrained) maximum entropy production principle: 

max{ cr(^) | a(fi) = (3£{v)^} = a(p) (7.7) 

(3) The above also provides an explanation for the counter example 
to MinEP given by Landauer, [8], [9]. There one considers an 
electrical circuit with resistance R, inductance L, and voltage 
source S in series. The physical entropy production is a(j) = 
/3Rj 2 , corresponding to the Joule heat caused by the current j 
through the resistance R. Apparently, the stationary current 
j* = S/R does not coincide with the minimum of the entropy 
production. 

To understand the situation, we embed the network dynamics 
in a stochastic process by combining Kirchhoff 's second law with 
the Johnson-Nyquist noise voltage on the resistance to get the 
equation 

d Jt = \{Z- Rh) dt + (7.8) 

(the Nyquist prefactor for the noise being determined from 
the fluctuation-dissipation relation). That is a linear Langevin 
equation of the form (17.21) for the current which is odd under 
time-reversal. Hence, the conclusion of the previous remark ap- 
plies and, in particular, both Theorem 15.11 and MinEP are no 
longer valid. Yet, we can obtain the correct variational princi- 
ple from the DV-functional. 
Consider indeed the functional 

IQ) = inf {JO*) | (j), = j} (7.9) 



which, by the contraction principle, is the large deviation rate 
function for the empirical average jr = ^ Jo Jt dt as T | +oo, 

P[j T ~]]ocexp[-J7(J)] (7.10) 

Here it is easy to compute from (17. 3ft : 

'o>-¥(j-D" <-> 
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and that is then also the corrected variational functional to 
consider. 

For other examples and for further details we refer to PQ. 

(4) Our result as formulated in Theorem 15.11 is no longer valid if 
we are away from the perturbation regime and the assumptions 
are not verified. As an example, consider again a Markov dy- 
namics on a finite state space Q and let \x be a distribution 
supported in some f2 £ fl, i.e., /i(x) = for all x G Q \ fV 
As one immediately checks from (15. 3p . cr(fi) = +oo whenever 
there are some x G f2 , y G Q \ f2 such that /j J (x)k(x,y) ^ 0. 
On the other hand, the DV-functional is bounded: < 
max x J2 y ^ x k(x,y). 

(5) The Donsker-Varadhan theory is not restricted to the time- 
averages in the sense of (12.11) . More generally, one can study 
fluctuations along a discrete sequence of observations with a 
time interval r between the observations. The time-averages 
are then of the form 

~(F(X T ) + F(X 2T ) + . . . + F(Xt-t) + F(X T )) 

and we are concerned with their large deviations along the limit 
T = nr ] +oo. For every r there is a rate function / T (/i). The 
case (12.21) corresponds to lim T |o 7 t ()u)/t = Obviously, one 

can investigate the close-to-equilibrium behavior for every one 
of these cases and, in principle, one obtains for each of them a 
variational principle. 
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